How to Account for Idiomatic German Support Verb Constructions in Statistical Machine Translation
نویسندگان
چکیده
Support-verb constructions (i.e., multiword expressions combining a semantically light verb with a predicative noun) are problematic for standard statistical machine translation systems, because SMT systems cannot distinguish between literal and idiomatic uses of the verb. We work on the German to English translation direction, for which the identification of support-verb constructions is challenging due to the relatively free word order of German. We show that we achieve improved translation quality for verb-object supportverb constructions by marking the verbs when occuring in such constructions. Additional evaluations revealed that our systems produce more correct verb translations than a contrastive baseline system without verb markup.
منابع مشابه
Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate
This paper presents a systematic human evaluation of translations of English support verb constructions produced by a rule-based machine translation (RBMT) system (OpenLogos) and a statistical machine translation (SMT) system (Google Translate) for five languages: French, German, Italian, Portuguese and Spanish. We classify support verb constructions by means of their syntactic structure and se...
متن کاملNominalizations in German Support Verb Constructions
Nominalizations in German Support Verb Constructions Angelika Storrer TU Dortmund ___________________________________________________________________________ Support verb constructions (SVC), such as to take a risk or to set into motion, are composed of a support verb (SV, e.g. take, set) and a predicative noun (PredN, e.g. risk, motion), which is – in most cases – a nominalization of a verb or...
متن کاملIssues in Translating Verb-Particle Constructions from German to English
In this paper, we investigate difficulties in translating verb-particle constructions from German to English. We analyse the structure of German VPCs and compare them to VPCs in English. In order to find out if and to what degree the presence of VPCs causes problems for statistical machine translation systems, we collected a set of 59 verb pairs, each consisting of a German VPC and a synonymous...
متن کاملProject Proposal: How Verb-Particle Constructions Affect Translation Quality in English and German
This project deals with verb-particle constructions (VPCs) in the context of statistical machine translation, with the purpose of finding out if and to what degree they affect translation quality in English and German. Its main goal is to identify which types of VPCs cause translation errors and to find possible solutions to this problem. VPCs are a type of multiword expressions (MWEs), meaning...
متن کاملModeling verbal inflection for English to German SMT
German verbal inflection is frequently wrong in standard statistical machine translation approaches. German verbs agree with subjects in person and number, and they bear information about mood and tense. For subject–verb agreement, we parse German MT output to identify subject–verb pairs and ensure that the verb agrees with the subject. We show that this approach improves subject-verb agreement...
متن کامل